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ABSTRACT 


In the modern era, shopping malls and Big Marts maintain a comprehensive record of their sales data for each individual item. This 
data is utilized to predict future customer demand and update inventory management. These records are stored in data warehouses, 
containing a vast amount of customer data and item attributes. By mining this data, anomalies and frequent patterns can be identified. 
The resulting information can be leveraged to forecast future sales volume using various machine learning techniques, specifically 
for retailers like Big Mart. In this study, we propose a predictive model that utilizes the XG boost Regressor technique to forecast sales 
for companies similar to Big Mart. It one the affordable method when comparative to other methods Our findings indicate that this 


model outperforms existing models in terms of performance. 
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INTRODUCTION 

The Big Mart, a prominent supermarket chain with nationwide 
presence, has issued a challenge to all Data Scientists. The 
challenge entails developing a model capable of accurately 
predicting the sales of each product in every store. In order to 
accomplish this, Big Mart has obtained sales data from Kaggle, 
encompassing a wide range of products across multiple stores in 
different cities. By leveraging this valuable information, the 
corporation aims to identify the pivotal products and stores that 
significantly impact their sales. This knowledge will enable 
them to implement appropriate strategies to ensure the success 
of their business. Notably, the current machine learning 
algorithm is highly sophisticated, offering a plethora of methods 
to forecast sales for organizations of any nature. This advanced 
algorithm proves particularly advantageous in mitigating the 
challenges associated with low-cost prediction techniques.. 


MATERIALS AND METHODS 

The methods used for sales prediction with XGBoost include 
data preprocessing, splitting the data into training and testing 
sets, training an XGBoost regression model, and evaluating the 
model's performance. Data preprocessing involves steps like 
data cleaning, feature engineering, data scaling and 
transformation, and encoding categorical variables. Splitting the 
data into training and testing sets allows for training the model 
on the training set and evaluating its performance on the testing 
set. The XGBoost model, known for its high performance in 
regression and classification tasks, is trained using the training 
data. Finally, the model's performance is evaluated using 
appropriate metrics such as Mean Absolute Error. 


RESULTS 

The Results section helps us to for better analysis and prediction 
of the sales and enhances the business sales and increases market 
share in the public. The potential benefits of utilizing XGBoost 
for sales prediction in Big Mart are emphasized, with a particular 
focus on its advantages. However, it is essential to underscore 
the importance of conducting further validation. 


The co-relation matrix 


DISCUSSION 

Discussion Sales prediction using XGBoost is a valuable 
application for a retail giant like Big Mart. XGBoost is a 
powerful machine learning algorithm known for its 
effectiveness in regression and classification tasks. When 
applied to sales prediction, it can yield significant benefits for 
the company. By utilizing XGBoost, Big Mart can make data- 
driven decisions based on historical sales data, customer 
behavior, and other relevant information. This enables the 
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company to optimize various aspects of its operations 


CONCLUSIONS 

Nowadays, shopping malls and Big Marts maintain a 
comprehensive record of their sales data for each individual 
item. This data is utilized to predict future customer demand and 
update inventory management. These records are stored in data 
warehouses, containing a vast amount of customer data and item 
attributes. By mining this data, anomalies and frequent patterns 
can be identified. The resulting information can be leveraged to 
forecast future sales volume using various machine learning 
techniques, specifically for retailers like Big Mart. In this study, 
we propose a predictive model that utilizes the XG boost 
Regressor technique to forecast sales for companies similar to 
Big Mart. Our findings indicate that this model outperforms 
existing models in terms of performance. 


The utilization of XGBoost for sales prediction in a retail setting 
like Big Mart is a valuable and data-driven approach. It 
empowers the company to make data-driven decisions, optimize 
operations, and enhance customer satisfaction. It's an essential 
tool for staying competitive and adapting to the dynamic retail 
landscape. 


Sales predictions also aid in pricing strategies. Big Mart can set 
prices dynamically based on expected demand and inventory 
levels, potentially increasing profit margins. 
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The below picture represents the seasonal sales Data 
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The below picture represent the montly average sales data 
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The below picture represents the average sales ina store 


lili 


E-ISSN No: 2454-9916 | Volume: 9 | Issue: 11 | November 2023 


The below represents sales distributions 


Pie chart distribution 
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